RecountDB: a database of mapped and count corrected transcribed sequences
نویسندگان
چکیده
The field of gene expression analysis continues to benefit from next-generation sequencing generated data, which enables transcripts to be measured with unmatched accuracy and resolution. But the high-throughput reads from these technologies also contain many errors, which can compromise the ability to accurately detect and quantify rare transcripts. Fortunately, techniques exist to ameliorate the affects of sequencer error. We present RecountDB, a secondary database derived from primary data in NCBI's short read archive. RecountDB holds sequence counts from RNA-seq and 5' capped transcription start site experiments, corrected and mapped to the relevant genome. Via a searchable and browseable interface users can obtain corrected data in formats useful for transcriptomic analysis. The database is currently populated with 2265 entries from 45 organisms and continuously growing. RecountDB is publicly available at: http://recountdb.cbrc.jp.
منابع مشابه
PHYLOGENETIC RELATIONSHIPS BETWEEN IRANIAN ISOLATES OF MICROSPHAERA AND ERYSIPHE S. LAT. BASED ON rDNA INTERNAL TRANSCRIBED SPACERS SEQUENCES
To study the phylogenetic relationships between Erysiphe s. lat. and Microsphaera, the nucleotide sequences of internal transcribed spacers ofrDNA including 5.8S rDNA gene were determined for 23 taxa. The results showed that Erysiphe. section Erysiphe and Microsphaera are closely related and clustered together with strong bootstrap support (100%). All oftaxa belonging to this group produce coni...
متن کاملMolecular characterization of Rhipicephalus (Boophilus) annulatus from Iran by sequences of cytochrome c oxidase subunit I (COI) and the second internal transcribed spacer (ITS2)
Background: Traditionally, morphological features of Rhipicephalus (Boophilus) annulatus from closely-related ticks have been considered for their identification and differentiation. However, it is difficult and requires expertise in order to accurately identify and differentiate engorged female ticks and some developmental stages such as larva and nymph from other similar ticks. Hence, molecul...
متن کاملMolecular Identification of Rare Clinical Mycobacteria by Application of 16S-23S Spacer Region Sequencing
Objective(s) In addition to several molecular methods and in particular 16S rDNA analysis, the application of a more discriminatory genetic marker, i.e., 16S-23S internal transcribed spacer gene sequence has had a great impact on identification and classification of mycobacteria. In the current study we aimed to apply this sequencing power to conclusive identification of some Iranian clinical ...
متن کاملP87: The Role of the Long Non-Coding RNA Sequences (LncRNAs) in Neurological Disorders
Precise interpretation of the transcriptome sequences in the several species showed that the major part of genome has been transcribed; however, just a few amounts of the transcription sequences have open-reading frames which are conversed during the evolution. So, it is unlikely that many of the transcribed sequences code the proteins. Among the all human non-coding transcripts, at least 10000...
متن کاملA generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences
The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...
متن کامل